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AN IMITATION OF LIFE 


‘An automaton is a machine that is able to operate on its own, following 
a sequence of programmed instructions. Historically, most automata 
were animated toys—often clockwork figures or animals, same of which 
were surprisingly lifelike, Animatronics, which are typically used to 
portray film or theme-park characters, are 
modem electronic automata 

In Al, the word “automaton” refers to 
a computer that can be programmed 
to perform a specific task, such as 
forecast the stock market 
or analyze customer 
behavior. The latest 
Alsare highly 
sophisticated, and 
appear to have minds 
of their own. However, 
‘one has yet to be built that 
can control its own actions. 


‘ANDROID 


‘An android a automaton 
that has been designee to 
‘mimic human behavior. 
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DEFINING 
INTELLIGENCE 


English mathematician Alan Turing 
(1912-54) devised a test that can be 
used to establish whether a machine 
has humanlike intelligence (see 
pp.130-31), Originally, the Turing test Rage 
focused on numerical intelligence (the mrctienice 
ability to perform mathematical 
calculations). However, scientists now 
argue that since there are different 
kinds of intelligence (such as artistic 


intelligence for it to be considered the 

equivalent of a human being. Broadly 
speaking, there are eight kinds of 

intelligence, including sensory 

intelligence (the ability to interact 

with one’s environment) and reflective 

intelligence (the ability to reflect upon 

and modify one’s behavior). 


«1 know that! | 
am intelligent, 
because | know 

that | know 
nothing.” 


L Sociates J 
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THINKING = COMPUTING 


The idea that all thinking, whether human or artificial, is. 
a form of computing (see p.15)—specifically, a process of 
using algorithms to convert symbolic inputs inta symbolic 
outputs (see p.36)—is known as “computationalism.” 
Computationalists argue that the human brain is a 
computer, and that one day an Al should therefore be 

able to do anything that a brain can do. In other words, 
‘they claim, such an Al would not merely simulate 
‘thinking—it would have genuine, humanlike consciousness. 


INPUT OUTPUT 
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ZEROS AND ONES 


A binary code is ¢ code that reps Ths: 
information, stich as instructions, using only. 
twornumbers, or digits. The binary code most 
‘commonly Used In:compultino features the 
numbers O and 1, each of which represents 

@ bit” of information, Any numiber can be 
convertedt into zeros.and ones (for example 
the decimal number 12 is 1100 in binary), as 
‘canaiiy-lettiee of ahy known alphabet. The 
tno digits can also represent the two states of 
‘an electrical current’~"on", or “off”—-meaning 
thats oflware transiated into binary code can 
be thad bya compute 


BINARY CODE 


anoutput. 


Processing 
‘An algorithin 
consists of 2 

series of steps that 
sequentially process 
‘the input data to 
give a desired 
output. 


The outputs any 
information ot 
data produced by 
the algerithen. 


STEP BY STEP 


An algorithm is a sequence of instructions for 
accomplishing a task. It takes an input, such as information 
or data, and processes it in a series of steps to produce a 
desired result, or output. The task or process can range from 
a simple calculation, or following a recipe to make a meal, to 
solving complex mathematical equations, An algorithm is an 
‘example of what mathematicians call an “effective method.” 
which means it has a finite number of steps and produces 
a definite answer, or output. 
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‘Components 
of calculation 
Computations have 
aninputand an 
‘output, and multiple 
steps. They ean vary 
‘rom simple sumeto 
‘complex equations. 


ALGORITHMS IN ACTION 


A computation is a calculation that follows the steps of an 
algorithm (see opposite). The most straightforward example of 
computation is arithmetic calculation. For example, if you add 
‘together 2 pair of three-digit numbers in your head, you follow 
a series of steps, or an algorithm, to achieve this calculation. 
‘Computations use symbols to represent numbers, but symbols 
can represent almost anything else (see p.36). With the right 
symbols and the right algorithms, immensely complex 
computation becomes possible. 
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MULTITASKING 


‘program can 
run multiple sets 
‘of instructions at 

the sometime 


ourPur 


INSTRUCTING 
COMPUTERS 


A program is a sequence of instructions written in code 
that enables a computer to perform one or more tasks. 
Charles Babbage (see opposite) imagined the first program 
He was inspired by the design of a certain silk loom, which 
had parts that moved up or down in response toa pattern 
of holes punched into a card. Babbage recognized that these 
holes could store instructions to operate the cogs and levers 
of a machine he was designing: the “Analytical Engine” 
Modern computers work on the same principle, following 
sequences of instructions, which are usually written 
in binary code (see p.13). 


PROGRAMS 


ANIC 
ect AL Co 
In the 19th century, a 


the complex work of producing 
numerical tables (used in navigation, 


« A 
x warfare, and other fields) was performed by % 
KR w 


people known as “computers.” To avoid mistakes 
caused through human error, English mathematician 
Charles Babbage (1791-1871) invented what he called 
the "Difference Engine”—a machine that could perform 
mathematical calculations mechanically. Babbage then 
designed the “Analytical Engine’—a general-purpose 
calculator that could be programmed using punched 
cards (see opposite), and had separate memory and 
processing units. Although it was never built, the 
Analytical Engine had many of the key features 
of modern computers (see p.22). 


BABBAGE’S MACHINES 
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which way to moveand 
ether to replace the digit. 
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A THEORETICAL COMPUTER 


In 1936, English mathematician Alan Turing (1912-54) 
proposed an imaginary machine that could solve any problem 
that could be made “computable” (see p.15). In other words, 
as long as the problem could be written using symbols and 
algorithms, and translated into binary code (see p.13), his 
machine could solve it. The device consisted of a head that 
moved over a tape marked with binary information. Although 
it was never built, Turing’s Universal Machine sparked the 
computer revolution by proving that a machine could tackle 
any computable problem. 


Problem-selving machine 
‘A “readjwrite” head moves back and, 
forth along a paper tape. Following 
Instructions from an algorithm, it 
changes 1s to 0s, and vice versa, 
depending on what has come before, 
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AN ELECTRIC BRAIN 


Alan Turing (see pp.18-19) had shown that machine 
could carry out any computation (see p15) with the right 
combination of symbols. In 1943, scientist Walter McCulloch 
(1898-1969) and mathematician Walter Pitts (1923-69) 
demonstrated that networks of units based on human 
nerve cells, or neurons, passing electric signals back and 
forth, could copy a Turing machine. They suggested that 
the brain might be a kind of living computer, meaning 
that a program that ran on the human brain might also 
Tun on an electric brain. This theory is known as the 
principle of “multiple realizability” (see p.136). 
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A PROGRAMMABLE 4 
COMPUTER 


The Electronic Numerical Integrator and Computer 
(ENIAC) was an early electronic computing machine built in 
the US between 1943 and 1946, Made up of over 18,000 
vacuum tubes (electronic components resembling light 
bulbs) and covering 1,800.sqft (167 sqm), it calculated range 
tables (a list of the angles and elevation needed to hit a 
target) for artillery, giving its answers on paper punchcards. 
In just 20 seconds it could complete a calculation that took 
people hours using clectromechanical calculators, ENIAC 
__as programmed by changing the arrangement of cables 

4 


"¢, 


‘that plugged into it, which took days to complete. It was the > 
first machine computer that could run different programs. = 


Making the best move 
‘Turochamp calculated all of the 
responses it could make, gave 
point values to these, and then 
selected the highest-value move. 


vF 


A THEORETICAL PROGRAM 


In 1948, Alan Turing (see pp.18-19) and mathematician 
David Champerowmne (1912-2000) set out to prove that, with 
the right algorithm, a computer could play a game of chess. 
‘At the time, no electronic computer existed that could run 
such an algorithm, so Turing played the role of computer 
himself, performing each step of the algorithm on paper. 
“Turochamp,” as they called it, was further proof that 
computers (whether human or artificial) could perform 
complex calculations without understanding what they 
were doing, but simply by following a set of instructions. 
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A COMPUTING 
BLUEPRINT 


John von Neumann (1903-57) was a 
Hungarian-American scientist involved 
In developing ENIAC (see p.22), the first 
programmable computer. He devised a 
model (see right) that established how 
the main components of modern-day 
computers are structured—known as 
von Neumann architecture, The major 
advancement was the use of a memory 
unit that contained both the programs 
(See p.16) and data (see p.32), making the 
machines quicker and easier to reprogram 
than existing ones. Information within 
the memory unit feeds into a central 
processing unit (CPU). Within the 

CPU is a control unit that decodes the 
program into instructions, which are 
enacted by an arithmetic and logic unit 
(ALU), using data to perform calculations 
and tasks, The results of these are then 
fed back into the memory unit. 


r 


| inputdevices, such a5 
3 | a teyboardand mouse, 
HE cnable user to input 

data intothe machine, 


INPUT 
DEVICE 


Structural advantage 
‘This diagram shows von 
Neumann's architectuce, 
Because the memory units 
could be upgraded, the 
‘machines could be made 
faster and more powerful 


- 
“Any computing machine that is 
to solve a complex mathematical 
problem must be ‘programmed’ 
for this task.” 


john ven Neumann 


al 
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CENTRAL 
PROCESSING 
UNIT (CPU) 


CONTROL 
UNIT 


ARITHMETIC 
AND LOGIC 
UNIT (ALU) 


MEMORY 
UNIT 


The 
‘unit and ALU, and links tothe 
Input and output cevces 
DATACONTROL, 
Thiseontrols te flow 


cof data within the CPU 
‘and instructs the ALU. 


OUTPUT 
DEVICE 


Output devices, such asa 
‘monitor or printer, enable 
| vsors to view the data 


“This folowsinstructions 
from the control unit 
and processes the data 
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Inputs can come 
from sensors 

such as cameras, oF 
fom a controle’ 
leet instructions 


—~-a 


Agent interaction 

‘An intelligent agent reacts to 

and affects the environment 
around it 


Sensors 


4AN39V 


Se / 


Any device that affects | 


Al IN ACTION 


An “intelligent agent” in Al is anything that can sense, 
respond to, and affect its environment—which can be 
physical or digital. Examples include robots, thermostats, 

and computer software programs. The agent has “sensors,” 
which it uses to perceive its environment, and “actuators,” 
which it uses to interact with its surroundings. The action the 
agent takes depends on the specific goals that have been set 
for it and on what it senses. Some agents can learn (see 
pp.58-59), so that they are able to change the way 
they react to conditions within their environment. 


INTELLIGENT AGENTS 
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TRIAL AND ERROR 


Machines that can follow 
simple instructions, such 

as calculators that apply 
mathematical rules, have 
existed for decades. 

Creating machines that 

can “lean”—the basis of 
modern Al—is far more 
recent and complex. To do so, 
programmers use algorithms. 
(see p14) that are repeatedly 
revised through trial and error 
‘to improve their accracy. 

Like natural evolution, the 
improvements made are 
gradual and incremental. As 
Als become more advanced, 
they are able to contribute 
to their own learning, 
although currently they 
require human assistance. 


Improved accuracy 
Teaching machines te 
learn maans making 
them more accurate 
and more reliable. 
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MIMICKING THE BRAIN 


Connectionism is an approach to A\ in which information is represented 
not by symbols but by patterns of connection and activity in a network, 
‘These patterns are known as “distributed representations,” and 
computation that is done in this way is known as “parallel distributed 
processing” (PDP). Connectionists believe that intelligence can be 
achieved by taking simple processing units, such as artificial neurons 
(see p.21), and connecting them together into huge “artificial neural 
networks” (ANNS, see p.76) to allaw PDP. As its name suggests, the 
connectionist model is based on how the brain works—using parallel 
processing across interconnected networks of cells, or neurons. 


OUTPUT 
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CLASSICAL AI 


Deduction 
Clacieal Al mimic 
human logic: they 
‘answer questions 
by following strict 
mathematical rules, 


Wl 


Al MODELS 


‘The earliest forms of Al are now known as classical (or symbolic) Als. 
They were constructed according to the top-down approach, in which 
computer designers first figured out the rules of symbolic reasoning — 

how humans think—and built them into the Als. Their performance was 
always limited by the rigid application of human-derived rules and their 

programmers’ understanding of them. In contrast, modern statistical Als 

are constructed according to the bottom-up approach. They are provided 

with masses of data and machine-learning tools (see pp.58-59) that 
‘enable them to find patterns in the data. From these patterns they are 
able to build models that show how particular systems (such as financial 
markets) operate under particular conditions, 


STATISTICAL AI 


Modeling 
Statistical Als use 
‘machine-learning 


toolsto construct 
models of how 
systoms work 
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Intel microprocessors 
Moore's prodiction was berne 

‘out closely until the 1990s. By the 
‘time the Pentium microprocessor 
2ppeared (replacing the 80406. 
chip), the rate of increase in 
processing power was already 
slowing down, 


Predicted rate 
of increase. 


NUMBER OF TRANSISTORS, 


History 


COMPUTING POWER 


Moore’s Law is named after Gordon Moore (1929-), the cofounder 
of integrated circuit chip-maker Intel. In 1965, Moore predicted that 
the number of transistors that could be fitted onto 2 computer chip. 

would double every two years. Due to advances in technology, 

particularly miniaturization, this prediction was barne out for decades, 
and although it has since slowed down, computing power is still 
increasing each year. This means that in the foreseeable future, 

if computationalism is correct (see p.12), Als will have the same 

‘amount of computing power as the human brain. 
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RAW INFORMATION 


Data is information that can take many forms, such as numbers, 
words, or images. In computing, data is a sequence of symbols that is 
collected and processed by a computer according to its pragramming. 

In modern computers, these symbols are the 1s and 0s of binary—or 
digital—code (see p.13). This data |s either “at rest” (stored physically 
in a database), “in transit” (being used for a finite task), or “in use” 

(constantly being updated), and it can also be shared between 

computers. Data is classified according to whether it can be 
measured and how this is done. 


‘QUALITATIVE 


'Non-numerical data, which 
can be words, images, or 
audio. It cannot be measured 
‘and must be interpreted 
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QUANTITATIVE 
Numerical data, which can 
be counted and statistically 
analyzed. It can be measured 
and is objective, 


Wy ee @ 
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EVERYTHING, EVERYWHERE, 
ALL OF THE TIME 


"Big data” is a phrase that describes data sets that are too large 
to be pracessed by traditional forms of data-processing software. 
Such data sets include massive amounts of information about people, 
their behavior, and their interactions. For example, mobile phone 
‘companies use their customers’ phones to track the movements of 
billions of people, every second of every day, and they record this 
information in vast data sets. Big data is widely used in Al, from training 
machine-learning models (see pp.58-59), making predictions 
about the weather or future customer behavior (see pp.70-71), 
to protecting against cyherattacks (see p.97), 
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CLASSIC 
ARTIFIC 


INTELLI 


From the 1950s to the 1990s, the dominant paradiam 

in Al research was classical (or “symbolic” or “logical”) Al 
This approach to Al was based on logical reasoning, using 
symbols and rules—written by human programmers—to 
represent concepts and the relationships between them. 
Classical Al had many successes, including Als that could 
play games, hold basic conversations, and answer queries 
using "expert systems.” Although statistical Al has since 
‘overtaken classical Al, the old approach has not been 
entirely abandoned; many of its techniques have been 
incorporated into modern Al applications, such as natural 
language processing and robotics. 


REPRESENTING DAT, 


In Al, a “symbol"is a graphical representation 
of a real-world item or concept—a simple type of 
symbol is a picture. A symbol can also be a group 

of other symbols, such as the letters that make 
up the name of an object. In classical Al, symbols 
‘embody the total sum of the relevant facts and 
information required for the system to understand 
what something is, To achieve this, data is labeled 

(ee pp.62-63) and attached to a symbol. The 

symbol for an apple would include a wealth of 

data stating what an apple is and is not. 


SYMBOLS IN Al 


FOLLOWING THE RULES 


Logic is the study of sound reasoning, and of the rules 
that determine what makes an argument yalid. In practice, logic 
enables people to take statements about the world (known as 
premises) and derive new information from those statements (known 
as conclusions). Als are programmed to follow strictly logical rules, 
with the aim of producing reliable conclusions, One such rule is the 
syllogism, which states: “If all As are Bs and all Bs are Cs, then all 
‘As are Cs.” This simple principle enables Als ta know that all items 
of a particular class will always have a particular characteristic. 


Syllogistic logic 

An Al that understands that fruit ic healthy, 

and that an apple's a fruit, also knows that 
apples are heaitny 


PREM 
APPLES ARE FRUIT 


ONCLUSION 
APPLES ARE HEALTHY 


FRUIT IS HEALTHY wy 


pe 
KC 
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WHAT, WHEN, WHY, 
AND HOW? 


Al systems use up to five kinds of knowledge in their 
interactions with the world, but only two are common to 
all Als. Declarative knowledge is the most basic form and 
describes statements of fact, such as "cats are mammals,” 

whereas procedural knowledge instructs Als how 
to complete specific tasks. In some Als, meta-, heuristic 

(see p.43), and structural knowledge provide further 

information that enables them to solve problems. 
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PRESENTING 
KNOWLEDGE 


In order for an Al to Cogien cepresectatot 


i Statements of information are clear, 
understand information logical, and unambiquous. 


correctly, the information 
must be presented to it 
very clearly. There are four 
main ways of doing this. 
“Logical representation” 


pases inet naaor slid ‘Semantic representation 

the exact words of a natural The relationships and connections between 
language (or symbols to facts within the information are made leer 
represent them). “Semantic 

representation” ensures that. iA isa 


the individual meanings a Gay aa 


within the information are 
connected in a formal, logical site 
way. "Frame representation” 


involves presenting the Four) sama Frame 
information in a tabular - representation 
format, with facts allocated fa lie ites a 
: ore ee) presented as simple 
to individual “slots.” Finally, <a] RMeEnee ve 
“production rules” are the bene slots contain details 
instructions that state what cuir 
conclusions an Alcan deduce pod. tion ules 
from the information it is When an "If" statement is true, 2 “THEN 
supplied with (see p.37). statement can be deduced trom i 
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IFSTATEMENTS. 


+ 


IF THIS, THEN THAT 


A.rule-based Al system uses instructions, consisting of “IF-THEN* 
statements, to draw conclusions based on an initial set of facts. 
In its simplest form, an IF-THEN statement says to the system: 
““f this condition is true for the current facts, then do this; if it 

is false, do nothing.” Adding an “ELSE” option allows for more 
complicated statements: “If this is true, then do this; otherwise 
(else), do that.” Rule-based systems are predictable, reliable, and 
“transparent,” meaning it is easy to see which rules the Al applies. 
However, rule-based Als cannot “learn” by adding to their store 
of rules and facts without human intervention. 
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THE SHORTEST ROUTE 


Pathfinding algorithms are search algorithms that are used to find 
the shortest route between two points. They have many uses, 
including vehicle navigation and computer gaming. The algorithm 
is programmed using a weighted graph (see below) that shows all 
of the possible paths available. The circles, or “nodes,” represent 
vaypoints, or special locations, which are joined by lines known as 
“edges.” Programmers add a weight (see p.78) to the edges, which 
reflects a “cost,” such as distance or time. The algorithm 
calculates the weights to find the shortest path. 
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IMPERFECT SOLUTIONS 


‘Some problems can be too complex for an algorithm to solve quickly. 
In such cases, an Al can do a “brute-force search,” which means to 
methodically work through and evaluate every possible solution. This 

is slow, however, and in some cases impossible. A more efficient 
alternative is to use a “heuristic.” This practical method uses a 
‘common-sense approach, searching for an approximate solution by 
estimating a “good enough’ choice at every decision point based on 
the information available. 
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Manhattan distance 

‘The Manhattan distanes heuristic maps 
routes by calculating squares moved 

Vertically and horizontally tcan be used 
to plot o path inan area witha grid 

system, such as Manhattan in New York, 
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PERFORMING A TASK 


Embodied Als (see p.118), such as robots, use a technique known as 
“planning” to help them solve practical problems. Planning involves 
understanding the environment or lacation in which the task must 

be performed and mapping out the actions required to complete it. 
The Al must identify each step required to fulfill the task and the 
coptimal—lowest cost—sequence in which to perform them (see p.42). 
If the optimal sequence is not possible, for whatever reason, it must 
also be able to decide the next-best alternative (see p.43). It must 
also identify and avoid any actions that would prevent it 

from completing its task. 


Planning ahead 
Inorder to complete ts 
‘ash, the robot brea! 

down into.a sequence 


f 


indus steps 
a 1 

2 : j a 

S 
— 

1.tdentify box 2. Locate amp 

Therobet'sintal Tectetaees 

omntexo fide Tamp to geronee 

Shue bexnits ee 

ee iedentes tl 


44 | PLANNING AND Al 


ENVIRONMENT 
“The robot must climb onto the 
platform to reach the blue box. 


3. Push ramp 
‘The ramp needs to. 
be adjacent to the 

platform, 30 it must 

be pushed into place, 


e 


4. Ascend ramp 
‘The robot can now 
Use the ramp to move 
up onto the platform 
next to the blue box, 


Task 
The robot's 
ig to push the blue 
box off the end of 
the platform. 


5. Push block off 
Once on the platform, 
the robot can push the 
box off the end. Its 
tackisnow complete, 
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Bayes’ theorem | Theprobabilty of event A 
‘The probability of one event 3 happening given that event 
happening—such at smoke B hes happened. For 
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frequency of smoke and fires. 


DEALING WITH 
UNCERTAINTY 


Most classical Als are based on the idea that logical statements 
{see p.37) are either true or false—that there is no room for 
Uncertainty. However, uncertainty is an unavoidable feature of life, 
and it can be incorporated into Als using the cancept of probability. 
Probability is a numerical value of how likely something is to occur, 
“Probabilistic reasoning” is any method of reasoning that takes 
probability into account. The English statistician Thomas Bayes 
(1702-61) developed a method, known today as Bayes’ theorem, of 
calculating the likelihood of an event happening. Instead of figuring 
‘out the probability of the event in isolation, Bayes’ theorem bases 
probability on prior knowledge of the relevant conditions. 
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A Markoy chain is a model that describes a sequence of 
possible events in which the probability of each event 
depends on the state that was reached in the previous event, 
The model predicts outcomes based on the rules of 
probability (see pp.46-47) and using data collected on the 
relevant subject. Once it has been trained (see p.61), it only 
needs to know the conditions of the immediate past (the 
previous state) to get the relevant information to predict 
the likelihood of the next state. Markov chains have many Al 
applications, from forecasting weather patterns and financial 
market conditions, to use in predictive text systems. 
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Deterministic models Stochastic models 
Ina deterministic model, ‘Astochastic model includes 
there areno random variables, random variables, Results are 
Results from a set of inputs will much less predictable and not 
be related in a predictable way. clearly related to each other 


MODELING UNCERTAINTY 


Stochastic models enable Als to make predictions about 
processes and situations that are effected by chance 
‘events, such as changes in the stock market or the 
growth rate of bacteria, The volatile and ever-changing 
factors in these scenarios are represented by random 
variables, and each is assigned a value based on the 
probability of it occurring, A stochastic model then 
processes thousands of combinations of variables and 
produces a distribution curve that shows the probability 
of different outcomes under different circumstances 
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AUTOMATED ADVICE 


Computer programs that replicate the knowledge and reasoning 
‘skills of human specialists are known as “expert systems.” The 
information that they contain is supplied by human experts, and is 
programmed into the system by a "knowledge engineer.” Each system 
has three parts. The “knowledge base” contains the facts and rules 
used by human experts on the topic. The “inference engine” applies 
the rules to the facts in the knowledge base to deduce answers to 
queries posed by users. The “user interface” accepts queries from users 
‘and displays solutions found by the system. Expert systems are able to 
answer complex questions and provide users with wider access to 
expert advice. They are usad in many areas, including medicine, where 
they match symptoms to likely causes and appropriate treatments, 


Human experts Knowledge engineer 
Experte supply the ‘The expert system 
knowledge and rules is programmed by 3 
within the system. “knowledge engineer. 
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Frinteltigence is not the ability to store 
formation, but to know where to find it.” 


L Abert etnstein J 


User 

The user asks. question 
and gets an answer via 
the interface. 


Inaction 
‘The three sections of an 
expert system interact to 
provide answers to the user. 
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HANDLING 
“MESSY” DATA 


Classical Als (see p 30) struggle with 
some tasks that humans find simple. 
We can program computers 
for reasoning-based tasks, such 
as playing chess, but not for 
sensorimotor- and perception- 


based tasks such as catching 

a ball or recognizing a cat. wns? 
The Austrian-Canadian Tr} 

programmer Hans Moravec a 

(1948-) argued that reasoning i 

tasks are easy to teach to | Anat canessly 

computers because humans have iene 

already figured out the steps that i 

are required to complete them. | weulastruggeto 


In contrast, sensorimotor 

and perception activities involve 
unstructured, or “messy,” data 
that requires a lot of processing. 
For humans, these tasks are largely 
unconscious actions, refined over 
millions of years of brain evolution, 
but they are difficult to break down 
into a series of steps that a 
computer can follow. 
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| for Als, particularly 
| classical modes, 
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NEATS VS. 


In the 1970s, Al theorist Roger Schank (1946-) 
noted that there are two types of Al research, 
which he called “neat” and “scruffy” (see 
opposite). The neat approach, which has since 
become dominant. builds Als by programming 
computers to follow strict mathematical rules. 
These rules enable Als to distinguish between 
different types of data, and to analyze those 
data by using machine-learning algorithms 
(see pp.58-59). Artificial neural networks 
(ANNs, see p.76), for example, are a triumph of. 
the neat approach. 
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STAT IST 
ARTIFIC 


INTELLI 


In the 1990s many researchers grew frustrated by the 
shortcomings of classical Al, with its focus on logic and 
deductive reasoning, and began developing statistical 
techniques instead. This gave rise to statistical Al, which 
remains the main focus of Al research today, At the heart 
of this approach is a technique known as “machine 
learning.” Machine learning involves using data sets to train 
‘Al models (including models that mimic the human brain, 
known as “artificial neural networks”) to perform tasks 
without requiring an engineer to program them explicitly to 
do so. This approach is thriving today, due to the availability 
of powerful computer hardware and large data sets 


paTiFiciaL INTELLIGE Nc, 


‘Thisis the science: 
‘of developing machines 
that can act and make 
decisions “intolligently". 


AAACHINE LEARN 


Machine learning focuses. 
‘on training computers to 
‘perform tasks without 
the need for explicit 
programming. 


over LEARN 


Deep learning isthe 
‘most sophisticated type of 

‘machine learning, tt requires 

minimal human intervention, 
‘and uses computer models. 
known as “artificial neural 

networks” that are based 

con the human brain. 


Wis predicting the future 
isn’t magic, it’s artificial 
intelligence.” 


TEACHING Als 
TO THINK 


‘Machine learning is a form of Al that enables 
‘computer systems to learn how to perform tasks 
without being explicitly programmed to do so. 
Programmers can write algorithms that tell 
‘computers pracisely which steps to follow to 
complete simple tasks. However, for more complex 
‘tasks, such as recognizing faces or understanding 
spoken conversations, it is incredibly difficult for 
programmers to write the necessary algorithms, 
and this is where machine learning comes in. 
Machine-learning algorithms use collections of 
sample data, known as training data (see p61), to 
‘build models that make predictions or choices based 
‘on new data. There are many kinds of machine 
learning, including deep learning (p.86), in which 
Als mimic the structure and behavior of biological 
brains by using artificial neural networks (see p.76), 
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=) GAINING INSIGHT 
] FROM DATA 


Data mining is a process of 
uncovering patterns of information in 
large data sets (sets of information) 
with a view to making the date useful 
for specific tasks. For example, 
data-mining software can scan the 
medical profiles of thousands of 
people to identify those diagnosed 
as being diabetic, and could inform 
them about new treatments for the 
condition. Data mining is a broad 
discipline that increasingly uses Al 
techniques to process volumes of 
data that are too large for humans 
to handle. Two key techniques used 
are “clustering” (see p.68) and 
‘anomaly detection” (see p.69) 
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VALIDATION DATA. 


TRAINING DATA TEST DATA, 


v usine Data 
Training data is used to 
teach an Al, during which 
PREDICTIONS validation data is used to 
monitor its accuracy. 
After teaching, the AL is 
assessed using test data. 


TEACHING MATERIALS 


Training data is a type of data that is used during machine learning 
(see pp.58-59) to teach Als how to perform tasks accurately. It is used 
by programmers to test, adjust, and fine-tune the Al (see pp.78-79) 
Until it gives the expected results—or outputs. “Validation data” may 
also be used to assess how accurately the Al processes the training 
data during the learning period. Once the Al has been trained, “test 
data’ is then used to assess the accuracy of its results. Machine 
learning requires a large amount of training data, which may 
be labeled or unlabeled (see pp.62-63). 
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FEATURES 
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Tagging features 
Ahuman operator tags all of the 
doteribing the features of “typo A 
apples. The Al learns that. together. 
these features definea “tyne A” apple, 
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GIVING DATA MEANING 


A “feature” is a characteristic, such as a pattern of pixels, that 
an Al can use as an input to predict a label, which becomes the 
output. In supervised machine learning (see p.72), Als learn to 
associate particular features with labels by processing training 
data sets (see p.61) that have already been labeled by a human 
operator. For example, if an image-recogrition Al trained with 
labeled photographs of animals is input a photograph of an 
animal with features such as white feathers, curved beak, 

and crest, it will probably output a label of “cockatoo.” 
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LABELS 


FRUIT DATABASE 


Taste: sweet, sharp 
Scent: fresh, tart 
Size? inchos/Sem 
Leaf: curved 

‘Skin: smooth, red 
Seeds: inedible 
Flesh: firm juley 


Predicting labels 
Knowing all of the features of 3 

‘type A” apple, the Alcan find it 
ing fruit database, and identify 
it with the labal “Apple A” 


“A baby learns to crawl, walk, and 

then run. We are in the crawling 

stage when it comes to applying 
machine learning.” 
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YES OR NO? 


‘A decision tree is a model of the decision-making process 
used by Als. It works by questioning data, to which the answers 
can only be “yes” or “no”. One kind of decision tree |s the 
“classification tree." By repeatedly posing yes/no questions, 
the Al splits the “root” (data set) into ever-smalier “branches” 
(subsets) that share particular features, until a single “leaf” 
(conclusion) is reached, pinpointing a specific classification 
within the data. Decision trees are commonly used in both 
machine learning (see pp.58-59) and data mining (see p.60), 
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TYPES OF DATA 


An algorithm that assigns labels to items (see pp.62-63) 
and then sorts them into categories, or “classes,” is known as 
a “classifier.” Through a process of supervised learning (see 
p72), Als are taught to classify items using a labeled training 
data set (see p.61), from which they learn to recognize the 
patterns associated with different labels. For example, 2 
spam filter is taught to detect features of spam and 
non-spam emails from a collection of labeled emails. Based 
on this training data, the Al can automatically assign the 
labels "spam” or “not spam’ to new emails, 


Dividing data sets DATA CLASS A 
Classifiers can separate 

data nto regions divided 

byalline known as a 

decision boundary. 
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Regression analysis is a machine-learning process 
(see pp.58-59) in which an algorithm is used to predict the 
behavior of one or more variables depending on the value 
of another variable. It is used in many supervised learning 
applications (see p.72), particularly those that are designed 
to find causal relationships between several variables, For 
‘example, it can be used to predict what the next day's 
temperature will be given today's humidity, wind speed, and 
atmospheric pressure, and data about how all four variables 
have behaved in the past. “Linear regression” (see above) is 
the most common form of regression analysis, and is used 
particularly in the fields of finance and economics. 
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GROUPING DATA 


Clustering is the process of dividing a data set into a 
number of groups based on commonly shared features. 
It isan unsupervised machine-learning technique (see p.73), 
which means that it is performed by Als on raw, 
unlabeled training data sets. Clustering |s especially useful 
for gaining insights into human behavior. For example, a 
company may use it to sort its customers into distinct 
groups, based on their purchase historles, so that it can 
target them more effectively with promotions. 


4 Honing research 
‘Clustering enables 
researchers to target 
‘specific groups of items 
for further analysis. 
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Cluster analysis: 
Als can identity 
‘anomalies ina 
data set by using 
clustering (cee 
opposite). 
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Anomaly detection is the process of identifying unusual 
(or “anomalous*) data in a data set. That is to say that the Al 
looks for items that do not fit a particular pattern or model 
built from its training data. Many anomalies are caused by 

mistakes in the data, such as incorrectly inputted units, or an 

Inconsistency in the type of measurement used. in such 

cases, it is important to find the anomaly so that it can be 

corrected or removed from the data set, However, anomalies 
can also draw attention to serious problems that lie outside 
the data set, such as a software malfunction or the Al being 

hacked by cybercriminals (see pp.96-97). 
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THE MOST 
LIKELY 
OUTCOME? 


Machine-learning models 
(see pp.58-59) can make 
predictions by analyzing 
patterns in historical data 
In Al a prediction is the 
‘output from a model that 
forecasts the chances of 
2 particular outcome, For 
‘example, if a customer 
buys a certain item online, 
an Al can use data about 
past purchases—both 
from the customer and 
from others—to predict 
what other items they 
might want. Prediction in 
Al does not always involve 
anticipating a future event. 
It can also be used to 
make “guesses” about 
‘events in the past and 
present, such as whether 
a transaction is fraudulent 
(see p.98), orif an X-ray 
indicates the presence 

of disease (see p.102). 
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Accustomor purchases a product 
from an online vender—for 
example, a toothbrush. 
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Similar items: 


The Al identifies othor itome 
‘frequently bought alongside the 
product—both by the customer 


‘and others. 


&. 


se 
Been 


Similar profiles 


‘The Al compares the customer's 
profile to2 large number of 


other profiles to fing 
similar matches. 


Prediction 
‘The Al predicts and then 
recommends linked items that the 
‘customer may want—for example, 
toothpaste and mouthwash, 


Prediction 
‘The Al uses the purchase history 
of similar profiles to predict other 
Items the customer might be 
Interested in. 
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MACHINE LEARNING 
WITH “LABELED” DATA 


‘Supervised learning is a type of machine learning in which 

an Ais trained using a “labeled” training data set (see p.61). 
Input and output data is labeled by a human so that the Al 
can learn the relationship between them. The inputs, outputs, 
and the rule that relates them are collectively known as a 
“function”. During training, weights (see p.78) are adjusted 

‘to make the function fit the training data, The resulting 
function can be used to predict outputs based on new 

inputs. Supervised learning can be used for classification 

(see p.66) and regression (see p.67). 
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The Alis able to clacsify 
inputs — for example, 
different fruits - froma 
large, unlabeled data set. 
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Laboled data sot Test data Prediction 
The Al istrained using After training, the Al is On the basic ofits 
labeled data - for ‘tested on unlabeled training data, the Al 
‘example, that the inputs Input data to check predicts the label of the 
are apples. its performance. inputis an apple. 


MACHINE LEARNING 
WITH “RAW” DATA 


Unsupervised learning is used to discover hidden structures 

in raw, unlabeled data sets, Although Als do not understand 

the relevance of these structures, they may still have real-world 
meaning. This approach is useful in the early stages of data mining 
(see p.60), to find patterns in large, unlabeled data sets, which can 
then be subject to human interpretation. An in-between method, 
semi-supervised learning, uses partly labeled data sets, which gives 
better results than entirely unsupervised learning. 
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LEARNING FROM 
FEEDBACK 


Reinforcement learning is an approach to machine learning 
(see pp.58-59) in which an Al is taught to perform a task 
through trial and error, To achieve this, the Al is programmed 
to recagnize “rewards” and "punishments," meaning positive 
‘or negative feedback, depending on whether it succeeds or 
fails. The Al learns that succeeding Is good and falling is bad, 
and repeatedly attempts the task until it is rewarded. For 
‘example, an autonomous vehicle trained in this way 
(see p.122) will be punished—receive negative feedback— 
until it learns not to go through a red traffic light. 


FEEDBACK: 
REWARD OR PUNISHMENT. 


Trial and error 
The Al learns te succeed in a task 
through the consequences of ite actions. 
It will seek rewards and avoid 
punishment until the taskis completed, 
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MODEL B 


MODEL C 


PREDICTION 


WORKING TOGETHER 


Ensemble learning is based on the idea that combining the outputs 
of multiple machine-learning algorithms produces a better result than 
a single model can. Using two or more models that have been built 
and trained in different ways, for example with different data sets, can 
“cancel out” their individual weaknesses and generate more accurate 
predictions. Ensemble learning can be used to “teach” a particular 
model to improve its predictive performance, but also to assess a 
model's reliability and prevent a poor one from being selected. 
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THE Al BRAIN 


Artificial neural networks (ANNs) are machine-learning 
models based on algorithms (see p.14). Their structure is 
similar to that of the brain, consisting of interconnected 

nodes—artificial neurons—that are organized into multiple 

“layers”. The nodes within each layer receive, process, and 

send date to the next layer in the network, until an 
‘output, or result, is produced. Each node works like an 
individual microprocessor that can be reprogrammed 
to handle the data ina desired way. Using training data 
(see p.61), programmers can teach an ANN to “learn” 
how to give the expected results, or outcomes. 
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NETWORK STRUCTURE 


Artificial neural networks (ANNs) are structured in “layers’— 
collections of processing nodes that operate together. Data flows 
from the nodes in one layer to those in the next. The first layer always 
contains the “input,” or incoming data. Next Is at least one “hidden” 
layer, in which the processing takes place. These layers are hidden in 
the sense that their data is not visible to a user in the way that the 
network's inputs and outputs are, Finally, the resulting data arrives at 
the “output” layer. All ANNs share this basic structure, but some are 
more complex: recurrent neural networks (see p.85) generate 
connections between nodes in the next or in previous layers, while 
deep neural networks (see p.86) can have hundreds of hidden layers. 
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ASSIGNING IMPORTANCE 


Al algorithms include variables—mathematical values 
that can change—that determine how data is processed. 
‘within an artificial neural network (ANN, see pp.76). 
‘When designing and training an ANN, programmers can 
give these variables greater or lesser influence within 
the algorithm. This influence is known as “weight.” The 
more weight an input has, the greater its influence over 
the output. The “bias” (see opposite) determines the 
threshold at which variables become significant. 
Adjusting the weights and bias allows the ANN to 
be fine-tuned to give more accurate results. 
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An artificial neural network (ANN, see p.76) is made up ofF 
layers of “nodes,” which receive and process data, Before 
a node can pass information on to the next layer of 
nodes, its output data must reach a certain value. This 
value—essentially a numerical score set by the ANN 
designer—is known as the “bias.” The node can only 
activate" and pass on its output data once the bias has 
been met. If the node is not activated, that path of data 
transmission stops. Different biases can also be set to 
direct data to specific nodes on the next layer of the ANN. 
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Cost function 
The difference between the 
expected and actual outputs gives 
‘the mode's performance. The goal 
is for them to be equal. 


MEASURING SUCCESS 


The performance of a machine-learning model, such 
as an artificial neural network (see pp.76), can be evaluated 
by its “cost function.” This is a measure of the change that 

occurs during training between the actual outputs from the 

model and the outputs expected by the programmer. This 
difference, called the “cost,” is expressed as a number. The 

higher the number, the greater the gap between the real and 
the anticipated outputs, and the poorer the model. As the 
model learns, the cost reduces and performance improves. 
The training is complete when the cost is zero, or as close 

to zero as possible. 
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IMPROVING PERFORMANCE 


A machine learning model improves its performance by 
fine-tuning its settings. Instead of having to process huge 
amounts of data, the mode| can start at a random data point 
and then "nudge" its way toward a better solution. The 
algorithm that trains it to do this is known as the “gradient 
descent.” Each time the model adjusts its settings, the 
gradient descent rates its success using the “cost function” 
(see opposite). Plotting the gradient of the cost function on a 
graph reveals a curve. The model reduces its cost function by 
following the steepest downward slope, When the slope levels 
Off, the model is as good as it can be and it stops learning 


Trial and error 
Using feedback from a 
gradient descent algorithm, 
the model tweaks ite 
settings until it reaches 

Its best performance. 
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REFINING 
THE MODEL 


The "delta rule’—also known as the delta 
Jearning rule—enables a single-layer artificial 
neural network (ANN, see pp.76) to boost its 

performance by refining its settings. It makes use of 


"gradient descent” (see p.81) to identify the best 
choices for improving the model. As the model’s 
itpuit gets nearer to the expected output, smafler 
adjustments are certied out, until the outputs are as 
close as possible to each ottier. Backpropagation 
(see p.84)is a generalized form of the deita rule that 
applies to ANNs with any numberof layers. 
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A ONE-WAY 
NETWORK 


A "feedforward neural network” (FNN) is 2 simple artificial neural 
network (ANN, see p.76) in which information flows forward 
only—from the input layer, through the hidden layers, to the 

output layer. The connections between the nodes in en FNN do 
not form “feedback loops*—in other words, autputs are nat fed 
backward as inputs, as they are in a recurrent neural network 
(RNN, see p.85). The most basic form of a feedforward neural 
network is a single artificial neuron (see p.21), which can undergo 
machine learning using gradient descent (see p.81). 
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FINE-TUNING DATA 


Backpropagation is a type of algorithm that is used to train artificial 
neural networks (ANNs, see p.76), specifically feedforward neural 
networks (see p.83). It is known as backpropagation because it begins 
at the final (output) layer and moves in reverse towards the first (input) 
layer. During this process, nodes are reprogrammed by adjusting their 
weights (see p.78) and biases (see p.79), using gradient descent 
(see p.81) to find out whether increasing or decreasing them will 
produce better results. This has the effect of fine-tuning the ANN 
to produce more accurate outcomes overall. 
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STRUCTURED DATA 


A recurrent neural network (RNN) is a type of ANN in which data 
can move backward in a “feedback loop”. RNNs are used to process 
sequential data—data that has to be in a specific order—such as 
language. While traditional ANNs process individual date points to give 
an outcome, RNNs maintain the essential structure and relationships 
within sequential data, so that it remains intact. In doing so, RNNs can 
be used to predict the next output of a sequence. They are used 
widely in natural language processing tasks (see pp:112-13), including 
training virtual assistants to carry out spoken conversetions. 
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BUILDING A BRAIN 
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PROCESSING 
VISUAL DATA 


A convolutional neural network (CNN) is a type of deep neural network 
(see p.86) that is similar to the structure of the visual cortex: the part 
of the brain that takes and analyzes information from the eye. CNNs 
are effective tools for computer vision (see p.110), since they can be 
taught to recognize features in input images, such as the pointed ears 
of cats. There are three types of layers (see p.77) ina CNN. The first 
type performs a function called a “convolution,” which allows features 


Input Convolution 

‘The input in CNN is: ‘A filters applied to the image 
typically an image—such to produce feature maps. This 
‘35 photograph of a cat. tnables features to be detected. 
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| get very excited when we discover a way of | 
making neural networks better—and when that's 
closely related to how the brain works.” 

[Sette Hinton af 
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in an image to be detected. These layers first extract low-level features 
(lines and edges), before extracting higher-level features (shapes). They 
work by passing a filter over the image that creates a “map” of the 
location of each feature on the image. Between each convolution, 
there is a “pooling’ layer, which reduces the complexity of the feature 
maps. The data from these layers is flattened, and then passes through 
a “classification” layer (see p.66), which identifies and labels the image. 


Pool Classification Output 
“Mess” is cut out to reduce ‘Througha process of The Alidentifies 
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Featuras aro abstractad. with an image of cat 
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“A world run by 
automatons doesn’t 
seem completely 


unrealistic anymore.” 


USES OF Al 


Whether via mobile phones or virtual 
assistants, we interact with Al applications 
everywhere—often without realizing it. 
Using these applications has changed the 
way we work, shop, and communicate, 
and has revolutionized many industries, 
including finance, healthcare, and 
agriculture. Other Al technologies, such as 
generative Al and autonomous weaponry, 
are still in their infancy, but these too will 
soon be in mainstream use 


APPLICATIONS OF Al 
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RANKING 


‘When an internet search engine is used, Al-generated 
rankings determine which sites appear most highly in the 
results, Same ranking algorithms locate and rank websites 

that contain the same terms, or “keywords”, as those entered 

into the search engine by a user. Those with the closest 
matches rank highest. Other algorithms rank websites more 
highly if they are accessed from many other sites, or if they 
are especially popular. 
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RECOMMENDING 


Based on an internet user’s browsing history, and that 
of others, Al recommendation algorithms can suggest 
websites, as well as products, that may interest the user. 
This can involve suggesting similar content to what the 

user has viewed previously or offering sites that similar 

web users have visited. To do this, algorithms make 
predictions (see pp.70-71). For example, if an internet user 
searches for advice on dog care, the Al algorithm will predict 
that they have or want a dog. It then searches the internet 
to find popular sites, and products, associated with dogs. 


torecommend 
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x” Traditional threat: oe 
detection software used by 
cybersecurity experts searches for 
known malware (see opposite) “signatures,” 
blocks the malware files it detects, and raises 
alerts. Incorporating an Al into the system 
enables cyber defences to identify and categorize 
new and mutated threats ("zero-day* malware) 
that would otherwise be undetectable since they 
do not match any known signatures. This is a vital 
development in cybersecurity, given the speed 
with which new threats arise. Als are also used 
to predict how and where a system might be 
breached, and to help respond to breaches. 


Log Action 

‘The Al logs the malicious ‘The NI performsan action, 
attack for futuro use—eithor such a raising an alarm or 
as historical data or to the blocking traffic, when an 
Istof signatures, attack has been detected, 


INTRUSION DETECTION 


Anomaly detection 

[An Al can monitor incoming potential 
threats by identifying unusual traffic that 
does not match patterns in historical data 


Signatures 
Potential new threats can also be compared 
against alist of attributes, or signatures, 
that are known to belong to malware, 
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Malware 
Short for “malicious 
fF software*malwareis 
jes anycomputer program mmm 
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Disinformation 

Using the internet, 

‘enemy agents can spread 

memes falsenews storiesto 
influence public opinion, 

create instability, and 
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The use of cyberattacks to target a nation state is known as “cyber 
warfare,” It is possible to inflict serious harm on a country remotely, 
disrupting key services and critical infrastructure such as power arids 
by disabling the information systems that control them. Cyber warfare 
tactics include denial-of service (DoS) attacks, malware such as viruses 
and ransomware, disinformation campaigns, and state-sponsored 
hacking. Al is used in cyber warfare to enhance these attacks, making 
them faster and more sophisticated, For example, Al-driven malware is 
very hard to detect: itis able to use machine learning (see p.58-59) to 
find weaknesses in 2 device's security system, attack it while posing as 
an accidental error, and then cause harm to the device. 
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DETECTING FRAUD 


Financial institutions are adopting Al systems to detect—and 
prevent—fraud. These systems can process vast amounts of 
data about past transactions, learning the ordinary patterns 
of behavior of a bank's customers. When transactions are 
made that do not fit this pattern (see p.69), an Al may flag 
‘them up as needing to be investigated or take other actions, 
such as freezing the customer's account. An Al may score 
each transaction on its likelihood of being fraudulent, then 
raise an alert when this score exceeds a certain threshold. 
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AI IN FINANCE 


High-frequency trading (HFT) is the use of specialized 
algorithms to make investment decisions and transactions 

‘at superhuman speed—performing millions of trades each 
day. Some financial institutions manage entire investment 
portfolios using HFT. By evaluating vast quantities of market 
data in real time, it can identify the best stocks and shares to 
buy and sell, identify the optimal time to place those deals, 
and perform transactions extremely quickly. To help inform 
its decisions, HFT may use natural language processing (NLP, 
see pp.112-113) to analyze news reports and social media. 
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JEDICAL RESEARCH 


Als not only speed up tedious work, they help 
‘open up new fields of scientific research. For 
example, using deep learning (see p.86) and 
painstakingly collected experimental data, scientists 
have taught Als to predict the 3D structure of 
“folded proteins"—the building blocks of life—with 
atomic precision. Previously, scientists could not tell 
how a protein's chemistry determined its folded 
structure. This “protein folding problem” was so 
complex that it remained unsolved for decades. 
‘Today, understanding how these proteins work has 
transformed medical research and accelerated the 
process of developing new drugs. 
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MONITORING HEALTH 


Als perform an important role in a new field of medicine known as 
telehealth. By wearing sensors that monitor vital bodily functions, such 
5 oxyaen intake and blood pressure, a person can go about their day 
knowing that if a sensor detects a problem, it will send a signal to their 
digital assistant (an app on their phone or personal computer), which in 
‘turn—via the internet—will alert an Al ata healthcare center. This Al 
will then compare the digital assistant’s report with previous data 
about the person and alert a physician if necessary. Crucially, this 
technology can detect problems that a person may not even be 
aware of. More generally, Al technologies can also be used to 
monitor people’s general fitness and well-being. 
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INTERNET OF THINGS 


The “Internet of Things” (IOT) is the network of interconnected 
devices that collect and exchange data via the internet—not only 
phones and computers, but also smart refridgerators, driverless cars, 
fitness monitors, security cameras, and tens of billions of other items. 
Due to the vast amounts of data that these devices collect—and the 
requirement that they respond appropriately to their users and 
environment—Als have become integral to the IOT. A smart energy 
meter, for example, may use an Al to identify patterns in a user’s 
energy consumption and suggest adjustments to reduce their bills. 
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‘Smart home 
Domestic appliances 
and even heating and 
lighting systems are 
neressingly internet 
enabled, extending 
the reach of the IOT 
throughout the home. 
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Embedded learning 
The Al learns within the devic 
using data collected by its sensors, 


SMART DEVICES 


The “intelligence” in the Internet of Things 
(1OT, see opposite) is mostly contained within 
clouds—remote computing systems usually owned 
by technology companies. Increasingly, however, Al 
software capable of machine-and deap learning Is 
being embedded in devices, such as mobile phones 
and smart watches. Using embedded Al removes 
the need to send data to and from the cloud 
continuously, reducing power usage, data 
processing time, risk of data breaches, and reliance 
‘on cloud providers. In real-time monitoring devices 
(see p.106), embedded Al allows almost instant 
detection and response. 
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10: 


MONITORING SYSTEMS 


‘The “Internet of Things” (see p.104) enables Als to monitor 
all kinds of equipment automatically, up to major infrastructure 
systems, such as gas pipelines, transportation networks, and electricity 
grids, Sensors distributed throughout these systems collect and 
transmit their data to Als, which then scan the data for anomalies 
(see p.69) and alert human technicians to investigate them further if 
necessary. Als are also used to predict where faults could occur in the 
future, enabling technicians to take action to prevent equipment 
failure. Such measures minimize the disruption caused by using 
complex equipment that needs regular maintenance, 
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Monitoring pressure 
Sencare manitar the pressure within a water pips and wirslesely 
transmit their data to an Al. Here, the Al detects an anomaly; the 
pressure i lower than it should be between sensors 2 and 3. 
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“SMART” FARMING 


Alisa key technology in "precision agriculture’—an approach to 
farming that optimizes the use of water and other resources in order 
to increase yields and minimize waste, Using devices such as drones in 
the air and robots on the ground, which collect data analyzed by Als, 
farmers can receive real-time information about their crops, enabling 

‘them to know which of them require water, pesticide, or fertilizer 
at any time, Such precise methods of farming may become 
indispensable in the coming decades, when the global population 
is set to increase by two billion people. 
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SENSORY Al 


Akey aspect of human intelligence is the ability to perceive the 
world through sight, hearing, touch, smell, and taste, Machine 
perception is the ability of computers to sense their surroundings 
Via dedicated hardware (such as cameras and micraphones), and 
to interpret the collected data and react appropriately, This allows 
computers to receive information from sources other than a 
keyboard and a mouse, which is a step toward aligning Al with 
human intelligence, Machine perception, which is vital for 
‘embodied Al (see p.118), includes computer vision (see p.110), 
machine hearing (see opposite), machine touch, machine smelling, 
‘and machine taste. 


108 | MACHINE PERCEPTION 


TRAINED 
NEURAL NETWORK ouTpuT 


“lit — 
o abut — yw 


Will] __ prem 


—— si fff 


‘A neural network trained through 
labeled or unlabeled data (see 
pp.72-73) can identify different 
Instruments in the mixture of sounds. 


PROCESSING SOUND 


Machine hearing is the ability of a computer to sense and process 
audio data, such as music or human speech. This interdisciplinary field 
employs both classical (see p.35) and statistical (see p.57) approaches 
to Al. Engineers developing machine-hearing technolagies attempt to 

replicate the abilities of the brain that people typically take for granted, 

such as focusing on a specific sound amid backaround noise. Speech 
recognition is a complex subfield within machine hearing. It aims to 
comprehend meaning in spaken language, often using 
deep learning (see p.86) to train models, 
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‘Computer vision is the ability of a computer 
to recognize images and videos—for example, to 
understand that a certain arrangement of pixels 
is associated with a picture of a cat. Engineers 
working in computer vision alm to automate the 
tasks performed by biological visual systems, such 
as the human eye and parts of the nervous system, 
The rise of deep learning (see pp.86) using 
multilayered artificial neural networks (ANNs, see 
76) and the availability of very large training data 
sets online have greatly advanced the field. 
Computer vision is used in many areas, including 
facial recagnition (see oppasite) 
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FACIAL RECOGNITION 


Facial recognition is a form of computer vision technology 
(see opposite) that matches photographs or videos of human 
faces to those stored in a database. An image of a face 
is captured, and its distinctive features, such as 
the distance between the eyes, are mapped to 
create a unique “faceprint” that is then compared 
with known faceprints, Facial recognition is mainly used for 
security, such as an authentication process on phones, and law 
enforcement, such as to 
identify someone from 
a database of known 


offenders. 


FEATURING MAPPING | 111 


UNDERSTANDING 
WORDS 


The ability of computers to “understand” and 
generate natural language—that is, language as it is 
typically spoken and written by humans—is a key 
‘element of mimicking human intelligence. This idea 
lies at the heart of the Turing test (see pp.130-131). 
Natural language processing (NLP), the research field 
dedicated to developing this ability, brings together 
Al, linguistics, and other disciplines. In the 1950s, 
researchers tried to emulate “linguistic intelligence” by 
providing computers with collections of handwritten 
language rules. More recently, the explosion in 
computing power and big data (see p.33) has enabled 
machine learning—particularly deep learning—to be 
integrated into NLP with impressive results. Among its 
many applications, NLP is used in machine translation 
(Gee p.114) and virtual assistance (see p.116). 


Elements of NLP 
There are five elements to 
NLP, which involve arranging 
letters into words and 
Interpreting the intended 
‘meaning of sentences. 
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Discourse integration Pragmatic analysis 
ay ‘The meanings ofconeecative Pragmatic analysis goes beyond 
| sentences are considered tetera meaning of re words 
we ‘together to give context. and attempts to interpret their 

| towerds and phrases. intended meaning. 
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Al INTERPRETERS 


Machine translation (MT) is the use of Al in the automated 
translation of text or speech from one language to another. Translation 
isa far more complex and subtle matter than simply substituting each 
word for its equivalent in another language. Consequently, MT is 
currently used more as a tool than asa replacement for human 
translators, There are three broad approaches: “rule-based MT” relies 
on linguistic rules, such as grammar and syntax; “statistical MT” uses 
‘the known relationships between words to predict whole sentences 
and phrases; “neural MT” uses artificial neural networks (ANNs, see 
p.76) trained to understand languages almost as well es people do. 


MACHINE TRANSLATION IN ACTION 


Rule-based MT 
)) This approach gives a quick but basic 
translation. Text and speech can be 
understood but often require 
further editing, 


‘Statistical MT 
‘This approach predicts words 

‘and sentences, and may not be fully 
accurate. The translated text often 
till equires furthor editing 


\ Neural MT 
) Atrained ANN ic accurate and can 
be constantly improved. Training an 
J ANN requires huge amounts of data 
‘andis very costly. 
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“Robot, do need to take 
‘an umbrella today?" 


“Yes. iti ikely to rain, 
Take a coat, to0." 


TALKING WITH AI 


Chatbots, such those employed by virtual assistants (see p.116), are 
programs that can carry out conversations via text or text-to-speech. 
Natural language processing (NLP, see pp.112-113) helps chatbots 
mimic how humans talk during conversations. Based on classical Al, 
statistical Al, or a combination of the two (see pp.54-SS), chatbots can 
range in sophistication. Businesses often use basic chatbots to answer 
simple customer queries instead of providing immediate contact with 
a human employee. The most sophisticated chatbots, such as ELIZA 
in the 1960s, can give the impression of intelligence. 
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Al HELPERS 


A virtual assistant is a software application or device that uses 
machine hearing (see p:109) and natural language processing 
(NLP, see pp.112-113) to perform tasks an command, such as searching 
tthe internet, playing music, or setting timers and alerts. Basic virtual 
assistants are essentially chatbots (see p.115), while more complex 
models can interact with other “smart” devices, via the Internet of 
Things (see p:104), to activate systems such as domestic lighting and 
heating. Many virtual assistants are cloud-based, and continually 
use voice data for training, which enables them to get better 
at predicting a user's needs and preferences, 
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CREATING 
AN IMAGE 


Al ARTISTS 


Generative Al is the field dedicated to synthesizing new 
content, such as images, audio, text, or video, based on an 
Input in any of these formats. For instance, a generative Al 
model could be trained to produce an image of a cartoon 
giraffe when prompted with the text input “cartoon airaffe.” 
Al image-generation has existed since the 1960s, and can 
use a variety of classical and statistical techniques. Recently, 
however, generative adversarial networks (GANS, see p.87) 
have proved such effective “artists” that they have prompted 
debate about whether art can be considered uniquely human. 
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INTELLIGENT ROBOTS 


A\s that are designed to interact physically with their environments 
are known as “embodied Als.” Such Alls, which include robots, mimic 
not only human cognitive intelligence, but human physical behavior as 
well. They do so with the help of sensors, motors, and other hardware, 
which enable them to perceive (see p.108), move in (see p.120), and 
affect (see p:121) their three-dimensional environments. Constructing 
such machines is an important step forward in Al, since much of what 
we consider to be intelligence in human beings involves our ability to 
interact with our surroundings. Embodied Als already include robotic 
vacuum cleaners and lawn mowers. 


EMBODIED Al 


Al COMPANIONS 


social robot is an embodied Al (see 
opposite) that is capable of interacting socially 
with humans, using speech, movement, facial 
expressions, and other humanlike behaviors. 
Social robots are limited as companions, since 
it is difficult to replicate many basic human 
abilities, such 2s manipulating objects (see 
pp.52-53) or understanding tone of voice. 
While largely treated as a novelty, they are 
nevertheless sometimes used in health and 
social care to alleviate loneliness, depression, 
and anxiety. Although they can come in 
any shape and size, most social robots 
are humanoids. 
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MANUAL DEXTERITY 


One of the greatest challenges in robotics is building 
machines that can interact physically with their 
environments. To perform even the simplest human 
action, such as picking an apple, a robot must have an 
excellent sense of sight, as well as a sense of touch, 
which enables it to apply just the right amount of 
pressure to manipulate objects correctly. Many such 
robots are currently used in controlled settings, such as 
factories, but they may soon be sophisticated enough 
to help with domestic chores in people’s homes. 
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DRIVERLESS CARS 


Autonomous vehicles are examples of mobile robots (see p.120). They 
use systems incorporating sensors, Al, and actuators (see p.27) to assist 
or wholly replace the human operator of a vehicle, whether on land or 
sea, or in the air. “Seif-driving” or “driverless” cars are a category of 
autonomous vehicle under development, and cars incorporating 
semi-autonomous technology are now available. Their arrival on 
roads is raising complex legal and ethical questions, such as who would 
be responsible for accidents caused by Al-controlled cars (see p.152). 
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Al AND 
WARFARE 


Military needs drive much Al innovation. This has 
Y +. oe led to the creation of sophisticated autonomous 
systems that can perform military tasks with little 

‘or no human intervention. Some, including 

reconnaissance drones, are nonlethal. Others, 
such as sentry guns, are deadly weapons in their 
‘own right, capable of Identifying, locking onto, 


rt and firing at targets. There is much debate over 
was 


whether to ban the deployment of lethal weapons 
that are fully autonomous—those that enable 
rapid response by removing the need for a human 
to give the final order to attack. 
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PHILOSO 
ARTIFIC 


INTELLI 


Als are designed to mimic human behavior—to calculate 

the way we do things, or, in the case of androids, to interact 
with the environment with humanlike aglity. However, as Als 
become more and more sophisticated, the question arises as 
to Where we should draw the line between the human and the 


artificial. Or, to ask the question another way: at what point 
should we say that an Al is, in fact, a person—has all of the 
qualities that a human has, and so should be aranted rights? 
The philosophy of Al addresses this central question. It 
examines the concepts of free will and consciousness, and asks 
what the difference is between an intelligence that has evolved 
biologically and one that has been built by human beings, 


HUMANLIKE 
Al 


For many scientists, building an AGI (artificial general 
intelligence) is the ultimate goal of Al research— 
although it may never be achieved, An AGI would be 


as intelligent as a human being, and may even have 
‘other human faculties, such as emotions or even 
consciousness, Ancther name for AGI is "strong Al,” a 
name that contrasts it with “weak Al,” which refers to 
all other Als that are built to perform specific tasks. 
Unlike a weak Al, an AGI would have something li 
intuition—the ability to know that somethi 
true without resorting to co 
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THE POINT OF 
NO RETURN 


In cosmology, a “singularity” is a point in space at 

which the familiar laws of physics break down, creating a 
phenomenon known as a “black hole’. In Al, the singularity 
is the name given to the point in time at which a machine 
will become as smart as the people who built it, and 
therefore clever enough to improve itself. Such a machine 
would be able to operate at the lightning speeds of a 
supercomputer, and s0 would swiftly achieve incredible 
abilities—including the ability to design Als itself. The 
singularity could therefore transform the world in 

ways that we simply cannot predict. 
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WHERE IS CONSCIOUSNESS? 


For centuries, philosophers have debated the question of how the 
mind and the brain interact—or, more broadly, how such a thing as 
consciousness can even exist in a physical world. The debate intensified 
in the 17th century, when scientists proposed that the universe is like 
‘a machine—a clockwork mechanism whose workings are in principle 
predictable. However, German philosopher Gottfried Leibniz 
(1646-1716) argued that if the physical world is mechanical, then the 
human brain must be linked to the rest of the body by the biolagical 
equivalents of cogs and pulleys, But if that is the case, he argued, 
then there is no place in the brain for consciousness, which he 
believed cannot be explained mechanically. 
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GAME eee 


Alan Turing (see pp.18-19) 
devised a test, now called 
the Turing test, that 
provides a means for 
judging whether or not a 
machine Is intelligent. The 
test is based on a Victorian 
parlor game, in which one 
person tries to figure out 
whether another person, 
who is hidden behind a 
screen, is male or female, 
judging by the answers 
they give to certain 
questions. In the Turing 
test, both a human and 

a computer are hidden 
behind a screen, and an 
examiner supplies them 
with mathematical 
problems to solve. If 

both sets of answers are 
correct, then the examiner 
cannot say which are the 
computer's and which are 
the human's. The computer 
has therefore passed the 
test, and can be said to. 

be intelligent. 


+ the same questions. 
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MACHINES AND 
UNDERSTANDING 


American philosopher John Searle (1932-) rebutted 
the idea that machines can think by arguing that 

while machines follow rules, they are incapable of 
understanding them (see pp.130-131). In what he 
called the "Chinese Room” thought experiment, 

Searle imagined a person in a room receiving questions 
written in Chinese. If the person had the appropriate 
rule book, they would be able to reply to the questions 
in writing, without actually understanding either the 
questions or the answers. Searle argued that to say 
that a computer can think is similar to saying that 

the person in the example understands Chinese, 


CHINESE NON CHINESE CHINESE 
SPEAKER SPEAKER SPEAKER 
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A NEW KIND 
OF PERSON 


Many scientists argue that, one day, Als will be so lifelike that they 
should be treated like human beings. They claim that since humans 
have rights on the basis that they have free will, Als that pass a 
“free will test” should therefore have the same protections under 
law. This means that, in the future, an Al could claim ownership 

of its intellectual property, and even be penalized for making 
mistakes. Legally, such an Al would no longer be a machine, 

but a person—effectively, a new kind of human being. 
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Personhood 
‘fan Als troated ike a person, 
it may be granted both rights 

and responsibilities. 
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REPLICATING 
THE MIND 


According to the principle of multiple realizability (see p.20), 

the same computer programs can be run, or “realized,” on 

different devices. Computationalists (see p-12) argue that 

human thought is computable, and so can be realized by a 

machine as well as 2 brain. if this is true, then it should be 

possible to write a program that replicates a human mind, 

which could then be copied and transferred like any other 
program. This means that a person’s mind could be 
uploaded to a remote server, downloaded to a robot, 

and even duplicated innumerable times. 
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Human-human interaction 
‘An Indlvidual knows what they 
themselves mean by “beetle,” but 
they cannot be cure that it means 
the same thing to someone else. 


TRANSPARENT THINKING 


The philosopher Ludwig Wittgenstein (1889-1951) argued that a 
person’s thoughts were like objects in a closed box—a box into which 
only they could “see.” We can never know what another person is 
really thinking or exactly what things mean to them, since the 
“box” is closed to us, A machine intelligence, however, could be 
examined in ways a human mind cannot. If the machine said it was 
thinking of a beetle, its programming could be exposed—“opening 
‘the box"—to show precisely what it meant by “beetle.” Such 
developments might, in turn, shed light on the mechanisms 
‘of human consciousness and thinking. 
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GARBAGE IN, GARBAGE OUT 


Machine-learning networks (see pp.58-59) are only as 
good as the data on which they are trained, The most 
‘common cause of inaccurate results from an Al system 
is poor-quality training data, which includes input data 
that is incomplete, poorly labeled, full of errors, or biased 


(see opposite). For exemple, predictive Al systems 
(6ee pp.70-71) trained on inconsistent and incorrect 


historical data will produce useless predictions. in the field 
‘of computer science, the idea that bad inputs produce 
bad outputs is informally summarized as “garbage in, 
garbage out,” or “GIGO.” 
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‘The term “Al bias" |s used to describe Al systems that produce unfair 
results for particular groups of people. Al biases often reflect 
prejudices in society about gender, ethnicity, culture, age, and 
many others. Bias usually stems from the programmers themselves, 
via their algorithms and their interpretations of results, and from 
the data sets used to train an Al (see opposite). To combat this, 
programmers test their models to ensure that societal bias is not 
reflected in their results and use data sets that are representative. 
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MAKING ASSUMPTIONS 


Using personal date to try to predict an individual's future desires, 

opinions, and activities is known as “profiling.” In Al, machine-learning 
tools can be trained on large data sets to become expert at predicting, 

for example, the kind of internet content a user might like to see 
based on their viewing history. However, profiling can be problematic, 
since it can lead to false and even damaing predictions due to biases 

built into data sets and algorithms (see p.143). In order to root out 

such biases, it is essential that Al decision-making processes 
be transparent (see opposite), 
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TRANSPARENT PROCESSING 


Machine learning models process data and make predictions 
using highly complex artificial neural networks (ANNs, see 
p.76). The inner workings of these models are often said to be 
a “black box” because they are too complicated and abstract 
for humans to “observe.” This means that the results they 
produce cannot be properly understood and checked for 
errors or biases. An alternative approach, known as 
“interpretable machine learning,” or “white box Al,” shines a 
light into the black box. White box Als are designed to give 
not just the result, but a breakdown of the processes 
they followed to reach it. 
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AN Al 
WORKFORCE 


The replacement of human beings by machines in the 
workforce is known as “technological unemployment”. Up until 
now, this phenomenon has not led to mass unemployment, because 
machines greatly increase productivity, which in turn stimulates the 
economy and creates new job opportunities, However, if Als begin to 
pass the employment test (see p.132), and achieve the intelligence level 
of AGIs (See p.126), then, one day, there may be few jobs left for human 
beings to do. Under such circumstances, the challenge for 
governments will be how to support the masses of unemployed 
people, which may include providing universal income—a regular 
payment to each member of society. 
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‘Al has the potential to increase productivity and generate 
Income and opportunity. Shared by all, these benefits could 
create a more equal world, but if concentrated in the hands 
of the wealthy and powerful, the gulf between rich and poor 

will widen. Bias in design, data, and how and where Als are used 
can exacerbate social divides, increase inequality, and lead to 
hazardous and discriminatory applications. Attempts to mitigate 
these risks include inclusive design and embedding Als with 
values, such as fairness and accountability. 
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Al algorithms are increasingly 
used to curate the content people 
see online—for example, on social media. An 
unintended consequence of this has been the 
creation of “filter bubbles,” whereby people 
are shown only content that tallies with or 
amplifies their own opinions, while alternative 
views get filtered out. This occurs due to 
“recommendation algorithms® (see p.95) 
repeatedly showing users material similar 
to what they have viewed in the past, 
encouraging biased thinking. 
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RIGHT VS. WRONG 


As Als become ever more intelligent, the question of how 
toensure that they behave ethically becomes increasingly 
important (see opposite). Machine-learning tools have 
neither agency nor values, and so cannot be relied upon to 
offer suggestions that are in the best interests of humanity, 
‘or do not favor one social group over another. The only way 
to ensure that Als think ethically is to program them with 
ethical principles, although then the question becomes: 
whose ethics? Ideally, an Al should have equal respect for all 
humans, and be able to detect and compensate for bias 
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BUILT-IN 
ETHICS 


‘One way of ensuring that Als behave ethically (see opposite) is to 
program them with specific ethical rules or laws—a process known 
as “terminal value loading.” The classic illustration of this can be 
found in the science fiction stories of Isaac Asimov (1920-92), who 
formulated what he called the “three laws of robotics” (see above). 
However, as his stories explore, terminal value loading is far from 
foolproof, since even the simplest laws can generate contradictions. 
For example, an Al may be instructed not to harm a human being, 
but doing so may be the only way of saving a person's life. 
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Some researchers argue that, one day, Als will 
not only be as intelligent as human beings, they 
will also have humaniike personalities, and so 
should be granted human nghts (see p.135). 
If an Al is given such rights, lawmakers would 
have to decide where to draw the line between 
holding the Al or its makers responsible for its 
actions. if the Alis deemed culpable for breaking 
the law—in other words, that it acted on its 
‘own free will—then it would have to suffer the 
appropriate sanctions or punishments for i 
actions, Like a human being, it could also be 
required to make amends for what it has done, 
and be open to reforming its character. 
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Concerns about the dangers that Als may pose in the 
future have fueled calls for Al research to be regulated. 
However, many scientists argue that regulating research 

will stifle innovation, and give unregulated countries a 
dangerous advantage. A compromise, proposed by European 
regulators, is to scale regulation according to risk. Low-risk 
applications of Al should have little or no regulation; high- 
risk applications should be controlled: and the most 
risky applications should be forbidden, 
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EXISTENTIAL RISKS 


One possible threat posed by Al is known as the “alignment 
problem,” whereby the goals and values of an Al do not align 
with those of humanity. Named after a scene in the Disney 
cartoon Fantasia, in which a sorcerer’s apprentice makes 
a broom multiply uncontrollably, “Sorcerer's Apprentice 
Syndrome” neatly illustrates the problem in the form of a 
thought experiment. An Alis given the task of optimizing 
the production of paperclips, but believes that its job is only 
done when it has converted the entire planet into paperclips. 
It does so because it does not realize that it must prioritize 
human life over paperclip production. 
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UNLIMITED REWARDS 


Many Al researchers believe that Al will usher in a golden age 

for humanity—a time when machines will generate limitless 
abundance and prosperity. They argue that, with more powerful 
Als doing all the work that humans used to do, people will finally 
be free to devote their time to leisure activities and to pursuing 
their personal dreams. At such a time, they claim, there will be no 
scarcity of resources, and so no crime, war, or injustice—and Als 
will be able to help us to solve the world’s remaining problems, 
from disease to climate change. 
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